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Abstract 

The limiting shape of the random Young tableaux associated to the in- 
homogeneous word problem is identified as a multidimensional Brown- 
ian functional. This functional is thus identical in law to the spectrum 
of a certain matrix ensemble. The Poissonized word problem is also 
studied, and the asymptotic behavior of the shape analyzed. 
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1 Introduction 

One of the most remarkable achievements in modern random matrix the- 
ory is the identification by Tracy and Widom [28] of the distribution which 
now bears their names. The Tracy- Widom distribution, denoted by Ftwi 
gives the fluctuations of, the properly centered and normalized, largest eigen- 
value of a matrix taken from the Gaussian Unitary Ensemble (GUE). Since 
then, the fluctuations of some apparently disconnected models have also been 
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shown to be governed by the same hmiting law. For example, this is the case 
of the length of the longest increasing subsequence of a random permutation 
(Baik, Deift and Johansson [1]) as well as of a last-passage time percolation 
problem (Johansson [20], [21]). We refer the reader to [22] for a survey of 
such topics. Along this path, connections between the eigenvalues of some 
random matrices and longest increasing subsequence problems will be further 
studied below. 

Let Xi,X2, Xn be a finite sequence of random variables taking values 
in an ordered set. The length of the longest (weakly) increasing subsequence 
of Xi,X2, ...,Xn, denoted by LIn, is the maximal k < N such that there 
exists an increasing sequence of integers I < ii < 12 < ■ ■ ■ < ik ^ N with 



LJtv = max {k : 3 1 < ii < i2 < ■ ■ ■ < ik N, with Xj^ < Xi^ < • ■ ■ < ^ik} ■ 

When Xi,X2, ■■■jXn is a random permutation of 1,2, ...,N, Baik, Deift 
and Johansson [1] showed that. 



independently and uniformly in an M- letter ordered alphabet, through a 
careful analysis of the exponential generating function of LI^, Tracy and 
Widom [30] gave the limiting distribution of LI^ (properly centered and 
normalized) as that of the largest eigenvalue of a matrix drawn from the 
M X M traceless GUE. This was extended to the non-uniform setting by Its, 
Tracy and Widom ([IB], [IH])- 

A very efficient method to study longest increasing subsequences is through 
Young tableaux ([12], [27]). Recall that a Young diagram of size n is a col- 
lection of n boxes arranged in left-justified rows, with a weakly decreasing 
number of boxes from row to row. A (semi-standard) Young tableau is a 
Young diagram, with a filling of a positive integer in each box, in such a way 
that the integers are weakly increasing along the rows and strictly increasing 
down the columns. A standard Young tableau of size n is a Young tableau in 
which the fillings are the integers from 1 to n. The shape of a Young tableau 
is the vector A = (Ai, A2, A^), where Ai > A2 > ... > A^ and for each i, Aj 
is the number of boxes in the ith row while k is the total number of rows of 
the tableau (and so Ai + ■ ■ ■ + A^ = n). 



Xi, <Xi, < ■■■ <X,,, i.e.. 




LIn - 




When each Xi takes values 
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Let {1, 2, M} be an M-letter alphabet. A word of length is a map- 
ping W from {1,2,...,A^} to {1,2, ...,M}, and let [M]^ denotes the set of 
words of length with letters taken from the alphabet {1,2,..., M}. A word 
is a permutation if M = N, and W is onto. The Robinson-Schensted corre- 
spondence is a bijection between the set of words [M]^ and the set of pairs 
of Young tableaux {(P, Q)}, where P is a semi-standard Young tableau with 
entries from {1,2, ...,M}, Q is a standard Young tableau with entries from 
{1, 2, A^}. Moreover P and Q share the same shape which is a partition of 
A^. From now on, we do not distinguish between shape and partition. If the 
word is a permutation, then P is also a standard Young tableau. A word W 
in [M]^ can be represented uniquely as an M x A^ matrix with entries 

(^w)ij = lTy(i)=i- (1-1) 

The Robinson-Schensted correspondence actually gives a one to one corre- 
spondence between the set of pairs of Young tableaux and the set of matrices 
whose entries are either or 1 and with exactly a unique 1 in each column. 
This was generalized by Knuth to the set of M x A^ matrices with non- 
negative integer entries. Let Ai (M, A^) be the set of M x A^ matrices with 
nonnegative integer entries. Let P(P, Q) be the set of pairs of semi-standard 
Young tableaux (P, Q) sharing the same shape and whose size is the sum of 
all the entries, where P has elements in {1,...,M} and Q has elements in 
{1,...,A^}. The Robinson-Schensted-Knuth (RSK) correspondence is a one 
to one mapping between A4{M,N) and V{P,Q). If the matrix corresponds 
to a word in [M]^, Q is a standard tableau. 

Johansson [21], using orthogonal polynomial methods, proved that the 
limiting shape of the Young tableaux, associated to the iid uniform m-letter 
framework through the RSK correspondence, is given by the joint distribu- 
tion of the eigenvalues of the traceless M xM GUE. Since L/^r is equal to the 
length of the first row of the associated Young tableaux, the corresponding 
asymptotic results follow. The permutation case is also obtained by Johans- 
son [21] and, independently, by Okounkov [25] as well as Borodin, Okounkov 
and Olshanki [7]. Its, Tracy and Widom ([ISj, [IH]) further analyzed the 
independent non uniform framework. They showed that the corresponding 
limiting distribution of LIn can be written in terms of the distribution func- 
tion of the eigenvalues of the direct sum of mutually independent GUEs with 
an overall trace constraint. More recently, in [16] and [T7|, via simple proba- 
bilistic tools, the limiting law of the longest increasing subsequence for finite 
and countable alphabets is given as a Brownian functional. We investigate 
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below the connections between random matrix models and longest increasing 
subsequence problems, for finite alphabets, through these Brownian function- 
als. Our tools rely on basic probabilistic and combinatorial arguments. 

Let us now describe the contributions of the present paper. In Section 
2, we list some simple properties of a matrix ensemble, which we call gen- 
eralized traceless GUE; the proofs of these simple facts are relegated to the 
Appendix. We relate various properties of the GUE to this generalized one. 
In particular, it is shown that their respective spectrum differ by a multivari- 
ate degenerate Gaussian vector. In Section 3, we obtain the limiting length 
of the longest increasing subsequence of an inhomogeneous random word as q 
Brownian functional. This Brownian functional has thus the same law as the 
largest eigenvalue of the block of the M x M generalized traceless GUE, cor- 
responding to the most probable letters. In fact, a more general result with a 
multivariate Brownian functional representation for the limiting shape of the 
associated Young tableaux is actually obtained. Finally, the corresponding 
Poissonized measure is studied in Section 4. 

2 Generalized Traceless GUE 

In this section, we list some elementary properties of the generalized traceless 
GUE and defer the proofs to the Appendix. 

Recall that an element of the n x n GUE is an n x n Hermitian random 
matrix G = (G^ij)i<i j<„) whose entries are such that: Gi^i ~ A^(0, 1), for 1 < 
i<n, Re 7V(0, 1/2) and Im (Gij) ~ N{0, 1/2),' for 1 < i < j < n, 

and Gi^i, Re{Gij). Im{Gij) are mutually independent for 1 < i < j < n. 
Now, for M > 1, A' = 1,...,M and di,...,dK such that Y.k=idk = M, let 
Qm {di, dx) be the set of random matrices X which are direct sums of 
mutually independent elements of the dk x dk GUE, k — 1, K (i.e., X is 
an M X M block diagonal matrix whose K blocks are mutually independent 
elements of the dk x dk GUE, k = 1, K). Let pi, ■ ■ ■ ,Pm > 0, Yl!jLiPj ~ 1? 
be such that the multiplicities of the K distinct probabilities p'^^\ are 
respectively di, dx, i-e., let nii = and for k = 2, ...,K, let = Yl'jZi dj, 
and so Pm^+i = • • • = Pmk+dk = P^^\ k = I, K. The generalized M x M 
traceless GUE associated to the probabilities pi, ...,Pm is the set, denoted by 
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(2.1) 



where X G Gm (c?i, dx)- Clearly, from (12.11) . J2i=i VPi^'ii = 0- Note also 
that the case K = 1 (for which di = M), recovers the traceless GUE, whose 
elements are of the form X — tr(X)l7v//M, with X an element of the GUE 
and Im the M x M identity matrix. 

Here is an equivalent way of defining the generalized traceless GUE: let 
X'^'^) be the M x M diagonal matrix such that 



and let X G Gm {di,...,dK). Then, X^ := X - J2k=i X^'^ G 6^° (pi, ...,Pm). 



Equivalently, there is an "ensemble" description of (pi, ...,pm)- 

Proposition 2.1 X° G G^ (pi, ■■■,Pm) if and only ifXP is distributed accord- 
ing to the probability distribution 



on the space of M x M Hermitian matrices, which are direct sum of d^ x d^ 
Hermitian matrices, k = 1,...,K, XlfcLi '^fe = M, and where rrii = 0, rrik = 



Ejt? dj, k = 2, K. Above, C = Ti-T.^=,<^kid^~^)/^ and-f{dXl^, dX^^ j.^) 



is the distribution of an M -dimensional centered (degenerate) multivariate 
Gaussian law with covariance matrix 




(2.2) 




(2.3) 




S 



-VP2P1 I-P2 



— 



\-VPmPi 



y/PMPM^l 



^-PM J 
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We provide next a relation in law between the spectrum of X and of X°. 

Proposition 2.2 Let 'K e Qm {di, ...^dx), and let X.^ e {pi, ■■■,Pm)- Let 
C^r ' ' ) be the eigenvalues of X, where for each k — 1, ...,K, ^'"fc+i, • ■ ■ , 
^mk+dk gj,g eigenvalues of the kth diagonal block (an element of the d^ xdk 
GUE). Then, the eigenvalues ofKP are given by: 

M M 

a = r - v^E Vw^M = e - v^E Vk^'' ' = -' ^- 

1=1 1=1 



Let ^GUE Ml ^GUE Ml ^GUE M eigenvalues of an element of the MxM 

GUE. It is well known that the empirical distribution of the eigenvalues 

{CguemI^^^^ converges almost surely to the semicircle law v with 

density V4 — a;^/27r, —2 < x < 2. Equivalently, the semicircle law is also 
the almost sure limit of the empirical spectral measure for the fcth block 
of the generahzed traceless GUE, provided d}. — > oo, k — 1,...,K. This is, 
for example, the case of the uniform alphabet, where K — 1, di — M and 
= 1/M. 



Proposition 2.3 Let ■■■,C,q^ be the eigenvalues of an element of the 

MxM generalized traceless GUE, such that ^1^'°'^^, • ■ ■ , ^^^^'^'^ are the eigen- 
values of the kth diagonal block, for each k = 1, ...,K . For any k = 1, ...,K, 
the empirical distribution of the eigenvalues (Co/v^)mfc<i<mfc+(ifc converges 
almost surely to the semicircle law v with density y/A — x'^/27r, —2<x<2, 
whenever dfe — > oo. 



Now for pi,...,pM considered, so far, i.e., such that the multiplicities 
of the K distinct probabilities p''^\ . . . , p^^^ are respectively di,...,dK and 
Pmk+l = ■■■= Pmu+du = P^^\ k=l,...,K, let 

CP^'-'P^ := |x = (xi, Xm) e M^' : x^,+^ > ■ > Xm,+d„ k = 1, K; 

M ^ 

Ev^^. = . (2.4) 
j=i ) 
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In other words, £pi'---'Pa/ jg ^ subset of the hyperplane Ylj=i \/Pj^j ~ 0> where 
within each block of size dk, k = 1, the coordinates Xm^,+i, ■■■,Xmi,+dk, 

are ordered. For any si, ...,sm £ let also 

'^(.t:!sM):='C''''---'''"n|(xi,...,XM)G R'':x,.<s,, z = 1,...,m}. (2.5) 

The distribution function of the eigenvalues, written in non-increasing order 
within each x dk GUE, of an element of (pi, ...,Pm) is given now. 

Proposition 2.4 The joint distribution function of the eigenvalues, writ- 
ten in non-increasing order within each d^ x d^ GUE, of an element of 
{pi, ...,Pm) is given, for any si, ...,sm e R, by 

PfCo < 51,^0 < S2, ...,eo' < sm) = [ f{x)dxi ■ --dxM-i, (2.6) 

(Sl,...,3^,j) 

where for x = (xi, Xm) £ , 

K 

fix) := CA, J] A,(x)2e-S"i^>'/2l^p„...,p,,(x), (2.7) 

k=l 

with Cm = (27r)-(^^-i)/2]^f^jo!l!---(4-l)!)"^ and where Ak{x) is the 
Vandermonde determinant associated to those Xi for which pi = p^^\ i.e., 

mk<i<j<rrn.+dt: 

Remark 2.5 When the eigenvalues are not ordered within each dkxdk GUE, 
the identity l{2.6\) remains valid, multiplying cm, above, by Y[k=i ('^fe') ^ and 
also by omitting the ordering constraints Xm,^+i > ■ ■ ■ > Xmi^+dk, k = 1, K, 
in the definition of C^^'-'''^'^' . 

The forthcoming proposition gives a relation in law between the spectra of 
elements of Qm {di, ■■■■,dK) and of {pi, ...,pm)- 

Proposition 2.6 For any M > 2, /e^ X G Qm (di, d^) and let X° e 
{pi, ...,Pm) ■ Let^^,--- be the eigenvalues of X., and let ^q,--- be 
the eigenvalues o/X° as given in Proposition \2.S[ Then 

{e,---,e') = {eo,---,^n + iZi,---,ZM), 



7 



where {Zi,--- ,Zm) is a centered (degenerate) multivariate Gaussian vector 
with covariance matrix {y/PiPj) j^j^j- Moreover, (^q, ■ ■ ■ , ^q^) and {Zi, ■ ■ ■ , Zm) 
are independent. 

The asymptotic behavior of the maximal eigenvalues, within each block, 
of X° G {pi, ■■■,Pm) is well known and well understood (see also Theorem 
15.21 and Theorem 15.41 of the Appendix for elementary arguments leading to 
the result below). 



Corollary 2.7 For k = 1,...,K, let max be the largest eigenvalue 
of the dk X dk block o/X° G (pi, ...,pm), then 

max 

lim -= = 2, 

dk-xxi y dk 

with probability one, or in the mean. 



3 Random Young Tableaux and Inhomoge- 
neous Words 

Throughout the rest of this paper, let Am = c^m}, ai < ■ ■ • < c^m, be 

an M-letter ordered alphabet and let W = X1X2 ■ ■ ■ be a random word, 
where Xi, X2, ■ ■ ■ , X^v are iid random variables with P (Xi = otj) = pj, where 
Pj > 0, and X^jliPi ~ ^- ^et r be a permutation of {1, M} corresponding 
to a non-increasing ordering of pi,p2, ■■■,Pm, i-e., Pt(i) > ■ ■ ■ > Pt{m)- Assume 
also there are K = 1, ...,M, distinct probabilities in {pi,P2, ■■■,Pm}, and 
reorder them as p^^'^ > ■ ■ ■ > p'^^\ in such a way that the multiplicity of each 
p^''^ is dk, k = 1, ...,K. In our notation, K = 1 corresponds to the uniform 
alphabet case, where di = M. Let mi = and for any k = 2, ...,K, let = 
Yl^Zi dj and so the multiplicity of each p^-^) is dk if ruk < T{j) < ruk + dk, 
j = 1, M. Finally, let Xiy be as in (11. ip the matrix corresponding to such 
a random word W of length N. 

Its, Tracy and Widom ([H], [19]) have obtained the limiting law of 
the length of the longest increasing subsequence of such a random word. 
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To recall their result, let {^i,---,Cm) be the eigenvalues of an element of 
^° (Pr(i), •••,Pr(M)), written in such a way that (6,---,Cm) = ■■■,Cdl^ 

^J*-^, , i.e., 'Ci*! are the eigenvalues of the fcth block, k = 

1, ...,K. Then (see [19] ) , the limiting law of the length of the longest increas- 
ing subsequence, properly centered and normahzed, is the law of max ^^^^ 

l<i<dl 

A representation of this limiting law, as a Brownian functional is given in 
|16j . A multidimensional Brownian functional representation of the whole 
tableaux associated to a Markovian random word is further given in [T7] . 
Below, we recover the convergence of the whole tableau, in the iid nonuni- 
form case via a different set of techniques which is related to the work of 
Baryshnikov [1], Gravner, Tracy and Widom [13], Doumerc [TU] and |17j . 

Let -B^(t), be the M-dimensional Brownian motion hav- 

ing covariance matrix 



■.= t 



I Pt{1) (l - Pt{1)) -Pt{1)Pt{2) 

-Pt{2)Pt{1) Pt{2) (l - Pr(2)) 



-Pt(1)Pt{M) 
-Pr{2)PT(M) 



\ 



\ -Pt(M)Pt(1) 



-Pt{M)Pt{2) 



■ (3.1) 



Pt{M) (l -Pt{M)) J 



For each / = 1, M, there is a unique 1 < k < K such that Pr(i) = p^'^\ and 
let 



^k+d,^. k—rrn. 

Li, = j2B-^^\i)+ sup Yl E (^^^'^(^5- 



J{l-m^4k) 



(3.2) 



j=mfc+l i=l 



where the set J {I — m^, dk) consists of all the subdivisions (t*) of [0, 1], 1 < 
j < I — rrik, < z < (ifc, of the form: 

t) e [0, 1]; <ty,ti = Oforj<0-t) = lforj>M-k + l. (3.3) 

With these preliminaries, we have: 



Theorem 3.1 Let X{RSK(X.w)) = (Ai, Am) be the common shape of the 
Young tableaux associated to W through the RSK correspondence. Then, as 
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N ^ oo, 



Ai - Npr(i) Am - Npr(M ) 



fl f2_fl fM_ fM-1 



(3.4) 



Proof. Let (ej)^^]^ be the canonical basis of M^^, and let V = (Vi, Vm) 
be the random vector such that 

P(V = ej)=p,-, j = l,...,M. 

Clearly, for each 1 <p < M, 

E{V,)=pj, Var{V,)=p,{l-pj), 

and for ji ^ j2, CoviVj^, V^J = —pj^^pj^. Hence the covariance matrix of V 



IS 



(pj{l-Pj) -P1P2 ■■■ -PIPM \ 

-P2P1 P2 (1 - J52) ■ ■ ■ -P2PAI 



(3.5) 



\ -PmPi -PmP2 ■■■ Pm{^-Pm)) 

Let Vi, V2, Vn be independent copies of V, where Vi = (V^^i, Vi^2, ^,m), 
i = 1, ...,N. Then has the same law as the matrix formed by all the Vij 
on the lattice {1, A^} x {1, M}. 

It is a well known combinatorial fact (see Lemma 1 of Section 3.2 in [T2] ) 
that, for all 1 < / < M, 



Ai + . . . + A^ = G\M,N) := sup I Yl • e P(M, iV), 

I (i,j)67riU---U7r; 

and TTi, ...,7ii are all disjoint^, (3.6) 

where V{M, N) is the set of all paths tt taking only unit steps up or to 
the right in the rectangle {1, A^} x {1, M} and where, by disjoint, it is 
meant that any two paths do not share a common point in the rectangle 
{1, N} X {1, M}. We prove next that, for any k = 1, M, 



G\M,N)-Nsi N^oo fl 



N 
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L'm, (3.7) 



where s/ = Y!j=iPT{j)- For / = 1, 



G^(M, A^) = max I J] Vi^ ; vr e P(M, iV) \ . (3.8) 

Moreover, each path vr is uniquely determined by the weakly increasing se- 
quence of its M — 1 jumps, namely Q = < ti < ■ ■ • < tM-i < 1, such that 
vr is horizontal on [[tj-iA^J , L^j^J] x {j} and vertical on { \tjN\ } x [j, j + 1]. 
Hence 

M [tjN\ 

G\M,N)= sup J2 Yl 

0=to<tl<---<tA/_l<tA/ = l . |, ,,| 

J — i «=[tj_iA'J 

Let p^aa; = maxi<j<Af J9j, J(M) = {j : pj = p^ax} C {1, M} and so o?i = 
card {J{M)) (the ctj, where j G J{M), correspond to the most probable 
letters). As shown in [171 Section 5], the distribution of G^{M,N) is very 
close, for large A^, to that of a very similar expression which involves only 
those Vij for which j G J{M). To recall this result, if 

G\M,N) = sup E 

then, as ^ oo. 



= to < ti < • ■ ■ < tM-l < tAf = 1 *~r ■ Z ^r I 

tj-l = tj /or j ^ J{Af) J — -I- «— L^J-IJ^J 



Gi(M,iV) Gi(M,Ar) jp^^^ ^^ ^^ 



^A^ VA^ 

i.e., as A^ — oo, the distribution of the maximum (over all the northeast 
paths) in (13. 8p is approximately the distribution of the maximum over the 
northeast paths going eastbound only along the rows corresponding to the 
most probable letters. Now, 

— sup > 



^A^ o = to<ti<... ^ VA^ 

tj_i - tj for j ^ J{M) 

(3.10) 

We next claim that, as A^ ^ oo, for any t > 0, 



B\t) 

N I ^ ^ i<i<M 

l<j<M 



11 



where ( B^t) ] is an M-dimensional Brownian motion with covariance 

matrix tS. Indeed, for any t > 0, since Vi,V2,... are independent, each 
with mean vector p = (pi, ...,pm), and covariance matrix S, 

y-L*^J Y--tNv 



N V y i<j<M 

by the central hmit theorem for iid random vectors and Slutsky's lemma. 
Next, for any t > s > 0, 

^L(tp)A^J - L(t - s)N\p TILV Vi - [sN\p 



N VN 



B^it-s) AB^is) . (3.11) 

/ l<j<M \ / l<j<M J 

The continuous mapping theorem and Slutsky's lemma allow us to conclude 
that 



N VN 



B^it) AB^is) . (3.12) 

^ 1<J<A/ V / l<j<Af / 

The convergence for the time points ti > ^2 > ■ ■ ■ > > can be treated in 
a similar fashion. Thus the finite dimensional distribution converges to that 

of (B^it)) . Since tightness in (C([0, 1]*^, rf*^(-, ■)) is as in the proof of 
Donsker's invariance principle (e.g., see [5]), we are just left with identifying 

the covariance structure of the limiting Brownian motion ( B^ (t) ] . For 

V / l<j<M 

each iV, 



E 



VL*^J T/. 



\tN\p, 



N 



[tN\E{\V^,-p,f) 



N 



<tE{\V,,-p,f). 

(3.13) 
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Therefore, supE ( J^IZ i^ij - [tN\Pj) /VN ) < oo. As iV ^ cx), for 

iV>l V ' / 

each 1 < j < M, 

Moreover, for any ji ^ j^? by the continuous mapping theorem. 



Since 



sup I E 

Af>l 



E 



< sup I E 

7V>1 

< OO, 

therefore. 



(3.14) 



Cov(B=Ut),B^Ht)] = hm Cov 



^ VtN\ 
i=l 

= tCov 



(3.15) 



Hence the M-dimensional Brownian motion I it) ) has covariance 

V y i<j<M 

matrix tS with S given in (13.51) . In particular, as — cx3, for any t > 0, 



y-L*^J V- -tNv 
N 



B'Kt) 
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l<i<A/, j£J(M) 



It is also straightforward to see that the covariance matrix of \^B^ (t) 
is the di X di matrix 



^ Pmax (1 Pn 



Pr. 



\ 



Pmax 



r'max 
c (1 - P« 



Pmax 



By the continuous mapping theorem, 



Pmax 
Pmax 



Pmax (1 Pmax^ J 



jeJ{M) 



(3.16) 



G\M,N)-Np^ax N- 



N 



sup (3.17) 



and the right hand side of fl3.17p is exactly L\,j, then fl3.9p . leads to 

G^{M, N)-Np„ 



N 



^max N~>oo 1 1 



(3.18) 



Now, for / > 2, G\M^ N) is the maximum, of the sums of the V^jj-, over I 
disjoint paths. Still by the argument in [H], {g\M, N)-G\M, N^j /y/N 

0, as N ^ oo, where G\M, N) is the maximal sums of the Vij over I disjoint 
paths we now describe. Let 1 < A; < iiT be the unique integer such that 
Pt{i) = P^''^- Denote by 0:^(1), aj(m^) the letters corresponding to the rrik 
probabilities that are strictly larger than Pr{i)- For each 1 < s < m^, the 
horizontal path from (1, j(s)) to {N,j{s)) is included, and thus so are these 
ruk paths. The remaining / — disjoint paths only go eastbound along the 
rows corresponding to the dk letters having probability Pr{i)- The set of these 
/ — rrik paths is in a one to one correspondence with the set of subdivisions 
of [0, 1] given in (13.31) . Therefore 

rrik N 

G\M,N) = J2J2^r,rU) 

j=l i=l 

+ ,/^P,, E E E i^ltU^,Nl,rU)-Vlf 
J{l-m^,dk) 



' j=mk+l i=l [t^._.AfJ 



(3.19) 
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Now, 

&{M,N)-Nsi 



+ sup > 2^ 



(3.20) 



Since the column vectors V'^, V2, are iid, again, as iV — >• 00, for any 
t > 0, 

N j \ J l<j<M ' 

i<i<Af 

where [B^it)) is an M-dimensional Brownian motion with covariance 

V / l <j<M 

matrix given in (13.11) . Hence, (I3.20p and standard arguments give 

G'{M,N)-Nsi N^oo fi 



'N 

Finally, by the Cramer- Wold theorem, as — cxd. 



fl f 2 fM 

-^M5 -^M5 ■••5 I 5 



(3.21) 

therefore, as 00, by the continuous mapping theorem, 

Al - iVPr(l) A2 - iVPr(2) Am - iVPr(M ) 

' ViV ViV 



^AT VA^ VA^ 

^Mi ^I'l ~ ■■■^ ~ ^) ■ (3.22) 
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The proof is now complete. □ 

Remark 3.2 (i) Let (^^q, be the vector of eigenvalues of an element 

of (Pt(i), ■■■,Pt(m)) , written in such a way that ^^''^^ > ■ • • > 
for k = 1,...,K. Its, Tracy and Widom IT^ proved that the lim- 
iting density of {{\i-Npr{i)) /^/Np^, {\M-Npr{M)) /\/ Npr(M)) , 
as N —>■ oo, is the joint density, of the eigenvalues of an element of 
{Pt{i), ■■■,Pt{m)) , given by \2. 7| j. By a simple Riemann integral ap- 
proximation argument, it follows that 



Ai - Nprii) Xm - Npr(M ) 
Thus, from Theorem \3.1{ 



(n L 



2 _ f 1 jM _ fM-l\ A fdl cM\ (n 9q\ 



4 The Poissonized Word Problem 

" Poissonization" is another useful tool in dealing with length asymptotics for 
longest increasing subsequence problems. It was introduced by Hammersley 
in [13] in order to show the existence of lim^r^oo E (LI^) / \/N, for a random 
permutation of {1, 2, N}. Since then, this technique has been widely used, 
and we intend, below, to use it in connection with the inhomogeneous word 
problem. 

Johansson [21] studied the Poissonized measure on the set of shapes of 
Young tableaux associated to the homogeneous random word, while. Its 
Tracy and Widom [19] also studied the Poissonization of L/jv for inhomoge- 
neous random words. They showed that the Poissonized distribution of the 
length of the longest increasing subsequence, as a function of pi, ...,Pm, can 
be identified as the solution of a certain integrable system of nonlinear PDEs. 
Below, we show that the Poissonized distribution of the shape of the whole 
Young tableaux associated to an inhomogeneous random word converges to 
the spectrum of the corresponding direct sum of GUEs. Next, using this re- 
sult, together with "de-Poissonization", we obtain the asymptotic behavior 
of the shape of the tableaux. 
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Let W = X1X2 ■ ■ ■ Xn be a random word of length A^, with each letter 
independently drawn from Am = {^i < ••• < «a/} with (Xj = aj) = pj, 
i = 1,...,N, where pj > and Ylj=iPj ~ i-^-' random word is dis- 
tributed according to T'w,m,n = Pm x ■ ■ ■ x on the set of words [M]^. 
Using the terminology of [5T], with N = {0, 1, 2, ■ ■ ■ }, let 

r M 

'Pm^ := U = (Ai, Am) G N^^ : Ai > ■ ■ ■ > Am, A, = iV 
I i=i 

denote the set of partitions of A^, of length at most M. The RSK correspon- 
dence defines a bijection from [M]^ to the set of pairs of Young tableaux 
{P,Q) of common shape A G V\IJ\ where P is semi-standard with elements 
in {1, M} and Q is standard with elements in {1, A^}. 

For any W G [M]^, let S{W) be the common shape of the Young tableaux 
associated to W by the RSK correspondence. Then S* is a mapping from [M]^ 
to 'PI^\ which, moreover, is a surjection. The image (or push- forward) of 
^w,M,N by S is the measure Pm.a^ given, for any Aq G 'P[^\ by 

^M,N (Ao) := ^W,M,N 

(A {RSK{Xw)) = Ao) . 

Next, let 

Vm := {a = (Ai, Am) G N^' : Ai > ■ ■ ■ > Am} , 

be the set of partitions, of elements of N, of length at most M. The set Vm 
consists of the shapes of the Young tableaux associated to the random words 
of any finite length made up from the M letter alphabet Am- 

For a > 0, the Poissonized measure of T'm,n on the set Vm is then defined 

as 

00 jY 

P^(Ao):=e""5^PM,Ar(Ao)^. (4.1) 

N=0 

The Poissonized measure P°^ coincides with the distribution of the shape of 
the Young tableaux associated to a random word, taking its values in the 
alphabet Am and whose length is a Poisson random variable with mean a. 
Such a random word is called Poissonized, and Lla denote the length of its 
longest increasing subsequence. 

The Charlier ensemble is closely related to the Poissonized word problem. 
It is used by Johansson [5T] to investigate the asymptotics of LI^ for finite 
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uniform alphabets. For the non-uniform alphabets we consider, let us define 
the generalized Charlier ensemble to be: 



M ^ M 

A? 



l<i<j<M j=l ^ j •' i=l 

(4.2) 

for all A° = (A°, Ag, X^j) G Vm, and where sao(p) is the Schur function of 
shape A° in the variable p = {pt{i), ■■■,Pt{m)) which we describe next. Let 
^1, ...,Ak be the decomposition of {1, M} such that Pr{i) = PtQ) = P*-^^ if 
and only if i,j G Ak, for some 1 < k < K. Clearly, dk = card (Ak)- Then, 



"Sao(p) = J? 5 (4.3) 

where Sm is the set of all the permutations of {1,...,M} and where hi = 
A° + M-i for i = 1,...,M. 

The next theorem gives, for inhomogeneous random words, both T'm,n{Xo) 
and the distribution of Lla- The first statement is due to Its, Tracy and 
Widom ([H], [in]), while the second follows directly from the fact that the 
length of the longest increasing subsequence is equal to the length of the first 
row of the corresponding Young tableaux. 



Theorem 4.1 (i) On [M]^ , the image (or push- forward) of ¥w,m,n by 
the mapping S : [Mf pjf ^ zs, for any A" = (A?, A", A^.,) G vS\ 
given by 

^MA>^')=sAp)f°- (4.4) 

Above, is the number of Young tableaux of shape A° with elements 
zn{l,...,N}: 

M 

l<i<j<M j=i ^ 1 ' -J' 

and sxo(p) is the Schur function of shape \^ in the variable p = (pt(i), 
Pt{m)) given in ( [^.3| ), with r a permutation of {1, M} corresponding 
to a non-increasing ordering of pi,p2, ■■■,Pm ■ 
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(a) The Poissonization ofFM,N is the generalized Charlier ensemble m 
defined in ( [^.^ . In particular, for the Poissonized word problem, 

Ar=0 



For uniform alphabet, Johansson [2T] obtained the convergence, as a — >■ 
oo, of the Poissonized measure on Vm to the joint law of the ordered eigenval- 
ues of the GUE. Next, following his lead and techniques, we generalize this 
result to the nonuniform case, where the convergence is towards the joint 
law of the eigenvalues (^i, ...,^m), ordered within each block, of an element 
of Qm {di, dx). The density of (^i, ^m) is, for any x e M^, given by 

= ^CM^A,(x)vs"^^^/^ (4.6) 

k=l 

where cm = (27r)-(^^-^)/2 nf=i (0!l! ■ ■ ■ (4 - and where 

mk<i<j<mh+dk 



Theorem 4.2 Let \{RSK{^w)) = (Ai,...,AAf) be the common shape of 
the Young tableaux associated to W through the RSK correspondence. Let 
i^iy-y^u) be the eigenvalues of an element of Qm {di, ...jdx), written in 
such a way that ^m^+i > ■ ■ ■ > ^mk+dk for k = 1, K , and let /gi,...,^^,^ be its 
density given by (^7^. Then, for any continuous function g 



on 



r TIT" I I Al-aPr(l) AM-aPr(Af) , , , , w / N , 



o— >oo 



(4.7) 

Proof. By Theorem |0, for any partition A° = (A?, A^, A^^) of G N, 
FmAHRSKCXw)) = A°) = sao(p)/^°. 
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where 

M 



l<i<j<M j=l ^ J ' •> I 

and where s\o{j)) is the Schur function of shape A° in the variable p 
as given in fl4.3p . Hence the Poissonized measure is 



M 



Ar=0 l<j<i<A/ i=l ^ J 

Next, for z = 1, ...,M, let 



then, as a — > oo, 

M „ / M \ 

^ .27rr^/2— TT»"^'^"^ e-^-i^?/2 

^jAO + M-j)! ^ ^ «^ Vr=i '^^^ / 

(4.8) 

and 

n (a°-a;+,-.) 

i<«<i<A// 



k=\ k<l 



fc=l k<l 

(4.9) 

Together with 

E/iNo-TT TT / M--(7(i)--mfc-(ife+r(i) , mfc+dfc-T{i)\ 
^ 11 11 \Pr{i) ) 

M K K 

i=l k=l k=l 

(4.10) 
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the limiting density of ((Ai - ap^(i)) /^apr(i), (Am - aPr(M)) / yjaPriM)) , 
as a — > oo, is 

K 

fc=i 

which is just the joint density of the eigenvalues, ordered within each block, 
of an element of Qm {di-, dx)- The statement then follows from a Riemann 
sums approximation argument as in [21]. □ 

The next result is concerned with " de-Poissonization" , and again is the 
nonuniform version (with a similar proof) of a result of Johansson. 

Proposition 4.3 Let = N + 3VN \nN and (3n = N - 3VN \nN. Then 
there is a constant C such that, for sufficiently large N, and for any < 
ni<N,i = l, ...,M, 

(Ai < ni, Am < um) " ^ < ^m,n (Ai < ni, Am < um) 

< Fjf (Ai < rii,. .., Am <nM) + ^. 

(4.11) 

Proof. The proof is analogous to the proof of the corresponding uniform 
alphabet result, given in [21] (see also Lemma 4.7 in [7]). It suffices to prove 
that fM,N (Ai < ni, Am < is decreasing in A^. Denote a random word 
in [M]^ by W^^^ = X1X2 ■ ■ ■ and let [M]^+i(j) be the set of the random 
words = X1X2 ■ ■ ■ XnXn+i such that X^+i = j, j = 1, M. Each 

word in [M](^+i)(j) is mapped into a word Fj e [M]^, by 

deleting the last letter X^+i. Clearly, Fj is a bijection from [M]^~^^{j) to 
[M]^. Moreover adding the letter Xjy+i can only increase the length of a 
row. Therefore, for alH = 1, M, 

A,(F, < A, (iy(^+i)). 

Now, let g{W^^^) = 1, if Xi{W^^^) < rii, for all i = 1,...,M, and let 
g (W^^^) = 0, otherwise. We have 
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Hence, 




^M,N+i (Ai < ni, \m < um) 

iy(JV+i)g[M]JV+i 
M 

vi/(Jv+i)e[A/]Jv+i(j) 

Vi/(JV+i)g[jv/]JV+i(j) 

M 

= Pm.jv (Ai < ni, Am < hm) • (4.12) 

This completes the proof. □ 

We are now ready to obtain asymptotics for the shape of the Young 
tableaux associated to a random word W G [M]^, when M and go to 
infinity. Before stating our result, let us recall the well known, large M, 
asymptotic behavior of the spectrum of the M x M GUE (|28], [29], [2T]). 

Theorem 4.4 ( Tracy- Widom) Let ^qu^ m largest eigenvalue 

of an element of the M x M GUE. For each r > 1, there is a distribution 
function on W , such that, for all (ti, G W , 

hm ¥guem khuEM < 2Vm + ^,/Ml/^ j = 1, ...,r) = 

Remark 4.5 T/ie multivariate distribution function F^ originates in 12^ 
and another expression for it is also given in IMf (see (3.48) there). For 
each r = 1, 2, ■ ■ ■ , the first marginal of F^ is the Tracy-Widom distribution 
Ftw- 



Once more, our next theorem is already present, for uniform alphabets, 
in Johansson [2T] . 
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Theorem 4.6 For each r > 1, let Fr{ti, ...,tr) on W be the distribution 
function obtained in Theorem \4-4\ Assume that di +00, as M ^ +00. 
Then, for all {ti, ...,tr) G W , 

lim limFlJXj < apmax + diapmax + tjd^^^^ y/ap^ax, j = 1, ■■■,r 

= Fr{ti,...,tr), (4.13) 

and, 

lim lim Fm,n{ \ < Npmax + 2 V dlNpmax + tjdi^^^\/Npmax,j = 1, r I 
di— >oo N^oo \ J 

= Fr{ti,...,tr). (4.14) 

In particular, for any t G M, 
lim lim Fw,M,NiLlN <Npmax + 2 diNpmax + td^^^^^y Npmax ) =FTw{t)- 

di— >ooA'— >oo \ / 

(4.15) 

Proof. By Theorem 14.21 for each r > 1, and for all (si, Sr) € W, 

lim P^,,, f >^J-^Pmax ^ ^ ^ ^ \ ^ p^^^^^^ ^ ^ ^ ^ 

V ^/<^Pmax J 

(4.16) 

where is the jt/i largest eigenvalue of the di x di GUE. Hence, for any 

(tl, tr) G IR*", 

lim P^^ (Xj < apmax + 2a/ diapmax + tjd^^^^y/apmaxj = 1, •••,?") 

= lim P^, fh^^F^ < 2v/rf;+ t,rf-^/^ J = 1, r) 

= P (0 < 2v^ + ^,rf^'/^J = l,...,r) . (4.17) 

As cii ^ 00, Theorem 14.41 gives the first conclusion, proving (14.131) . Next, by 
Proposition iJl with on = N + Sy/NlnN and (3^ = N - SVNlnN, there 
is a constant C such that, for sufficiently large A^, and for any < Sj < N, 

Pa^ (A, < sj,j = 1, ...,r) - ^ < Pm,^ (A,- < Sj,j = 1, ...,r) 

<Pff (A, <.„j = l,...,r) + ^. (4.18) 
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Next, N = {1- ttTv, with e„ = 3^/N\ogN / [N - 3y^N \ogN), whereas 
N ={l + Efs) Pn with ep = 3y/N hgN/ {N + 3y/N \ogN) . Since £„, 0, 
as — > (X), it follows from fl4.18p . by setting Sj = PmaxN + 2\/diPmaxN + 
tjdi^^^y/PmaxN, that 

lim P^/ (Xj < aNPmax + 2 a/ diaNPmax + tjd^^^^ ^aNPraaxd = 1, ■■■,r) 
N—*oo \ / 

< lim Fm^n (Xj < Npmax + 2 a/ diNpmax + t jd^^^^ ^/ Npmax , j = '^,---,r) 

< lim Pff (Xj < ^NPmax + '^\/dif3NPmax + tjdi^'^^/l^NPmaxd = l,---,?^) • 

(4.19) 

Now, 04.171) holds true with a replaced by a^- or /^at. Finally, fl4.14p follows 
from (I4.19P by letting di ^ oo. □ 

Remark 4.7 (i) The convergence results in Theorem \4.6\ are obtained by 
taking successive limits, i.e., first in N and then in M. For uniform 
finite alphabets, in which case di = M, Johansson f21f had previously 
obtained the convergence, towards Fi, for the length of the longest in- 
creasing subsequence, via a careful analysis of corresponding Kernel 
and methods of orthogonal polynomials. His results, which are for si- 
multaneous limits, require (In N)^^'^ / M — > and a/ZV /M oo. Also in 
the uniform case, under the assumption M = a (^N^^'^(lio.N)~^^'^^ , the 
convergence result ^.14^ is obtained in for simultaneous limits, via 
Gaussian approximation and a method originating in Baik and Suidan 
and Bodineau and Martin f^. Non-uniform results are also given 
m If. 

(a) In the permutation case, Baik, Deift and Rains and Widom 13^ 
proved the convergence of the moments of the rows of the associated 
Young tableaux. In the finite alphabet case, this convergence also holds. 
First, the convergence of the moments for the eigenvalues of the GUE 
towards the moments of Tracy- Widom distribution is well known. Then, 
since the eigenvalues of the traceless GUE and of the corresponding 
GUE only differ by a centered multivariate Gaussian vector, the con- 
vergence of the moments for the eigenvalues of the traceless GUE also 
holds. We are thus only left with proving the convergence of the mo- 
ments of ((Ai - A^Pr{i)) / a/A^Pt{i), (Am - Np^(^M)) /^/Np^^, to- 
wards the eigenvalues of the traceless GUE, as N —>■ oo. This, for ex- 
ample, can be proved through a concentration argument, as illustrated 
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in Section 2.3 of 12^ . which we describe next. For each i = 1, ...,M, 
let Xi{W) denote the length of the ith row of the Young tableaux associ- 
ated to the random word W = Xi- ■ ■ X^. Then, on the product space 
{1, ...,M}^, Xi is a Lipschitz function in that \Xi{W) — Xi(Y)\ < 1, as 
long asW = Xi--- Xn and Y = Yi- ■ - Yn differ only by one coordinate. 
Thus, by a standard concentration inequality, for any t > 0, 



P(|Ai -EAil >t)< 2e 
Therefore, for any q > 0, 

Xi - Npr(i) 



2N 



(4.20) 



E 



^Npr(i) 

< F{\Xi-EXi\ + \EXi - Nprii) I > v/A^t^/^) dt 
Jo 



< 



EAj - Nprii) 



^Npr(i) 

Now, for any gi, qu > 0, 

E 



+ 2 



dt 



(4.21) 



Ai - A^p^(i) 


91 


Am - Np^(M) 


^/Npril) 




^Np^^M) 



p 




M 

i=l 
AI 



Ai - Np^(i) 



(1) 



<?i 



A 



M 



Npr(M) 



(M) 



>t\dt 



P 



Aj - iVp^(j) 



< 



i=l 



Mqi 



EXi - Np^d) 



+ 2 



>t \ dt 



.Mil 



'dt 



(4.22) 



//sup |EAi — NpT-(i) I / a/ NpT-(i) < +00, which is the case if all the prob- 



N>1 



abilities pi, ...,pm are distinct (see fl^). it will follow that 



supE 

Ar>l 



Ai - Npr(l) 



qi 



Xm - Npri^M) 



(M) 



qM-\ 
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which, when combined with weak convergence, gives the convergence of 
the moments of the rows of the tableaux. 



5 Appendix 

We provide here the proofs of the various properties of the generahzed trace- 
less GUE hsted in Section 2. 

Proof of Proposition 12. 11 , Let X G Qm (di, dx), and let X° be given as 
in ([21]). Then, 

(-^1,1' -^2,2' -^M,A/) = ^0 (Xi_i, X2,25 •••5 Xjv/,A^) • 

Since (Xi,i, ...,Xm,m) ~ N(0,Im), and S °S ° = S o, it follows that 

(XO„...,X°,,^,) ~N(0,S°). 

Now, the off diagonal entries of X are independent of the random variables 

- y^X)?i v^X/^z, ...,Xm,a/ - ^/PMY.fiiVPi^i,h and thus the dis- 
tribution of X° is given by fl2.3p . On the other hand, suppose the matrix 
X° is distributed according to the probability distribution (12.31) . Clearly, 



the diagonal entries X'j' 
Moreover, 



X 







5 ■'^MM 



are independent of the off diagonal ones. 



E 



M 



, i=l 



E 



M 



E v^v^M^L 

Pi, \/Pm) S° (v^, VPju)' 

M M 

Epi(i-Pi) -EE^^-Pj' 

i=l i=l jy^i 

M M 

E^^-E^* 

i=l 

0. 



2=1 



Therefore, Y.tU VP^h = 0- Since (X? i, X.Ij m) ~ N(0, S") and S^S 
there exists a vector {Zi, Z^) ~ N(0,Im) such that 

(X° 1, ...,X°^^^j) = S° (Zi, Zm)' , 
26 



(5.1) 

OvO _ 



and, moreover, the vector {Zi, Zm) can also be chosen to be independent 
of the off diagonal entries of X°. Let X be the matrix X° with the diagonal 
entries X^ i, ...,X°|^j replaced by Zi, Zm- Then X G Qm {di, dx) and 
X° is given as in fl2.ip . □ 



Proof of Proposition 12.21 It is clear that = C ~ \/Pi^i=i \/pP^i,h 



d 

i = 1, M, are the eigenvalues of X°. Next, to prove that X^i^Ii y/Pi^i,i 
Xlilli ^/Pi^\ let Xp be the M x M matrix obtained by multiplying the kth 
diagonal block of X by \/p^. For each i = 1, M, there exists a unique 
1 < k < K, such that ruk < i < mk + dk-, and is an eigenvalue of the 
A;th diagonal block of X. Moreover, pi = p^^\ thus y/plC is an eigenvalue of 
the kth diagonal block of Xp, which is an eigenvalue of Xp as well. Then, 
v^^^) \/VMi^ are the eigenvalues of Xp. □ 

Proof of Proposition 12. 3i By Proposition 12.21 there exists an X G 
Qm {di, dx), whose eigenvalues ■ ■ ■ , satisfy, 

M M 

Co = e- = ^'- VviY.^^^'^ ' = 1' 
1=1 1=1 

If dk ^ oo for some k = 1,...,K, by Wigner's theorem [24], the spectral 
measure of {C/Vdk)^^^^^ converges weakly to the semicircle law u 
almost surely, i.e., for any bounded continuous function / : M ^ M, 

almost surely. Now, J^iLi \/Pi^i,i ~ ^(0; 1); ^"^^ P^^Vdk ^ as cifc oo, 
hence, 

\/dl 
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Next, for any bounded Lipschitz function /, almost surely, 



< 



i=mk+l 

- T f 

i=mk+l 



1 



fdu 

rrik+dk 



dkj dk 



i=mj.+l 



1 



rrik+dk 



i—nik+l 



'dk 



fdu 
(5.2) 



Now, 



rrik+dk 
i=mfe+l 



i=mj.+l 



< 



Lip 



0, 



(5.3) 



and the proposition is proved, since the bounded Lipschitz functions form a 
determining class for weak convergence ([HI Section 9.3]). □ 

Proof of Proposition [2741 From Proposition 12.21 



M 



1,...,M, 



1=1 



where • • • are the eigenvalues of an element of Qm {di, ...,dK), and 
where ^'"'=+^ > ■ ■ • > are the eigenvalues of the kth diagonal block (an 

element of the dk x dk GUE), for each k = 1, ...,K. Clearly, "^fli y/Piio = 0- 
Let us now compute the joint density of ^q, ■ ■ ■ , ^q^~^- Recall that the joint 
density of {^^, - ■ ■ , C*^) is, for any x G M*^, given by 



(5.4) 



k=l 



where cm = (27r)-(*^-i)/2 nf=i (0!1! ■ ■ ■ (4 - 1)!)" , and where 

mk<i<j <.'mk+dk 
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with again mi = and = Yl^=i dj, for k = 2, K. Consider the change 
of variables from {^\--- , ^*^) to (^i, ■ ■ ■ , Y) , where 



Y 



Then, 



det(J) 



det 



-y/PM-l^/Pl 

Pi 



M 

1=1 



piVp2 ■■■ -VpiVpm \ 

1 - Pm-1 -^Pm-i^Pm 



det 



y/PM-l y/PM ) 

/I \ 

1 ■•• ■•• 

■■■ 1 

\^/Pl ^Pm-1 ^/Pm) 

= \fPM-, (5.5) 

where J is e Jacobian of this transformation. Thus, the joint density of 
[il--- .i^'-\Y) is given by: 

1 



1 



CMC 



l<j<j<M 



-1,2/) 



(5.6) 



X 



□ 



where x° = {x\, ...,x\^ and x^j = ~^Y^j=\ Integrating y from — cxd to 
oo, shows that the joint density of (^qj ' ' ' 5^o^~"^) is /( 

Proof of Proposition 12.61 For any Si, Sa/ G 

P(e'<Sl,-,e*'<SA/ 



■ .. / w^,[x)\-^^^-y'dx,---dxM. 



(5.7) 
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where cm = (27r)-(*^-i)/2 nf=i (0!l! ■ ■ ■ (4 - 1)!)"^ Consider the change of 
variables 



M 



Clearly, ^f^-^ = 0. With C^l'-^Jl^^ as in ([23]), we have 

P(e'<5i,-,e*'<SA/) 



(5.8) 



^27r 



M 



E 



e^^P (^0 < si - v^Pil/, ■ ■ ■ , < - VPmI/) 



oo v27r 

P fed < ^^1 - ■ ■ ■ ,eo' < ^A/ - V^^' 



X 



(5.9) 
M sr^M , — 



where dx^ is the Lebesgue measure on 

= o|. The right hand side of (15.91) is the distribution function of the sum of 
the mutually independent random vectors (^q, • ■ ■ ,^o^) and (Zi, ■ ■ ■ , Zm)-, 
where (Zi, ■ ■ ■ , Zm) = {^/Pl, ^/Pm) Z with Z ~ A^(0, 1). □ 



Let Q^jjE M be the maximal eigenvalue of an element of the M x M CUE. 
Below, we give a simple proof of the fact that ^^c/e m/ ~^ 2 almost 
surely. This proof is based on a " tridiagonalization" technique originating in 
Trotter [31] (see also Silverstein [26] where similar ideas are used). Our first 
result is the well known Householder representation of Hermitian matrices. 

Lemma 5.1 Let G = {Gi,j)i<i,j<M be a matrix from the GUE. Then, there 
exists a unitary matrix U, such that 

( XM~i ■ ■ ■ \ 

XM-1 ^2,2 XM-2 • • ■ 



UGU* 








X2 ^M-l,Af-l Xl 

Xl Am,m J 



(5.10) 
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where Ai^i, Am^m ore independent A^(0, 1) random variables, and for each 
1 < k < M — 1, XM~k has a chi distribution, with M — k degrees of freedom. 
Moreover, for each k = 1, ...,M — 1, Ak^k is independent of XM-k, Xi- 



Theorem 5.2 As M oo. 



max 
GUE,M 



M 



almost surely. 



Proof. By Lemma [5.11 there exists a unitary matrix U, such that 



/ Ai,i Xm-1 

XM-1 ^2,2 XM-2 



T := UGU* 



\ 





V 








X2 Am-i,m-i Xi 
Xi Am,m J 



(5.11) 



where Ai^i, Ak^k are independent A^(0, 1) random variable, and for each 
l<k<M — l,Xk has a chi distribution with k degrees of freedom. Clearly 
G and T share the same eigenvalues. 

By the Gersgorin circle theorem (see [I5]), for any eigenvalue of G, 
letting also xo = Xm = 0, 

6e U [Ak,k — XM-k+l — XM-k, Ak^k + XM-k+l + XM-k] ■ 



k=l,...,M 



Hence 



k=l,...,My^yM VM VM 
For each k = 1, M, Ak^k ~ ^(O, l), thus, for any fixed e > 0, 



max I \ 

GUE,M , / Ak^k , XM-k+l , XM-k 

< max I , + 



(5.12) 



P 



max 



Ak,k 



k=l,...,My/M 



M 

> e] < ^P 

k=l 



M 



Therefore, X]m=i ^ 

0. Next, for any fixed e > 



max Ak k/V M 

k=l,...,M ' 



(5.13) 



>e\ < oo, and thus max Ak^k/vM 
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p 



max — — — 1 

k=l,...,M M 



> e 



P ( max xl < M(l - e 

k=l M " ^ 



P ( max xi > M(l + e) 

,k=l....,M ^ ' 



< P (xL < M{\ - e)) + MP (xL > M{\ + £)) . 
Now, for any y > 0, 

M_i _u 



(5.14) 



2e-%^ E n 



\ k=\ 1=1 
where M > 2. For y = M(l + s), 



+ 2(M-2)!! / a;^°^°^(2)e-^dx, (5.15) 



P(xL>M(l + £)) - 



1 



r (f ) 2f yM(i+, 



/ M 2 e 2 aw 



^ Me — (M(1 + e))T 2e — t"(M - 2)!! 



r(f) 



r(f) 



But, as M — > oo. 



Me 



M(l+e) 



(M(l + £))^ (1 + £)^ 



r(f)2 



and 



(M-2)!!~r{'|^^) 2f 



(5.16) 



thus ^~^iMP(xi^ > M(l + £)) < oo. Next, since f{u) = wf-^e't is 
increasing for < M — 2£, we have for M > 2: 



P (xL < M(l - £)) < 



M ;v7" Mir 

(M(l -£))^e-ir+^ 

£r (f ) 2f 



(5.17) 



32 



and by Stirling's Formula, Em=2P(Xm < ^{1 - e)) < oo. Therefore, 

Em=i P ( " ^ > < OO, ^ max^xl.-.+i/M ^ 1, and al- 



most surely. 



^rnax 



linisup^^^<2. (5.18) 

M^oo V M 

Next, since the empirical distribution of the eigenvalues ( iauE mI ) 

\ ' / l<i<M 

converges almost surely to the semicircle law v with density -\/4 — x^/2'k. We 
claim that, for any £ > 0, 

(Cmax \ 
liminf > 2-£ 1 = 1, (5.19) 

P (hminf e^^-^V^ > 2 - < 1, 

\ M— »oo ' / 

P(liminf _^/Vm < 2 - > 0, 



If not, i.e., if 
then, 
and thus. 



p(liminfe^^^,jv,/VM<2-£,i = l,...,M) > 0. 

Let the event := |HminfM^oo Cgc/b,m/V^ < 2 — £, i = 1, m| and 
consider the bounded continuous function 

{1, X <2-e\ 

^, 2-£<x<2; (5.20) 

0, x>2. 

Then, on A,, Y.f=i fe {Cgue,m/^) = 1 and / f.du < 1, and so 
i=i 
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which is clearly a contradiction. Letting e — in fl5.19p yields, 



^max 

liminf ^^fl*^ > 2. a.s. (5.21) 



Therefore, combining flS.lSp and fl5.21l) . ^gue mI^^ ~^ ^ almost surely. □ 
To prove our next convergence result, we first need a simple lemma. 

Lemma 5.3 For each k = 1, let xl chi-square random variable with 
k degrees of freedom. Then, 

(max xV 
^^''-'^ I = 1. (5.22) 

Proof. First, 



E (,i^ax^/') > E ixlf) = M. 



Next, by the concavity of the logarithm, for any < t < 1/2, 



tE 



max xl 

k=l,...,M 

M 



<i^ln lyEe'^l] 



< — In I M- ^ 



M V (1 - 2t)^^/' 

iln(l-2t). (5.23) 



M 2 

Hence, 

max Yfc » . 
thmsupEl <--ln(l-2t) 



and letting t — 0, 



M^oo 



M / - t^o 2t 
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(Since -ln(l - 2t) < 2t + At^, for < t < 1/3, taking t = ^In M/2M in 



(1523]), will give E ( max xl/M ) < 1 + 2^2 In M/M, for M > 10. 



□ 



Again, in the uniform finite alphabet case, where pi = ■ ■ ■ = Pm = ^/M, 
we have K = 1, di = M. For k = 1,...,M, and to keep up with the 
notation of [17], denote by H^^ the particular version of L^, as in (I3.2p . 
Let B'^it), B^\t)^ be the M-dimensional Brownian motion having 

covariance matrix 

/ 1 p ■■■ p \ 
p 1 ■■■ p 



t 



(5.24) 



y p p ■■■ 1 

with p = -1/(M - 1). Then, for k = I, M (see also [IT], [lO]) 



M 



M 



M k 



M 



i=l p=l 

where the sup is taken over all the subdivisions (tf) of [0, 1] as in (13.31) . As 
a corollary to Theorem 13.11 (see also [I6]), for each M > 2, 



Moreover, the following convergence, in L^, holds. 

M X M traceless GUE. As M oo, 

"Tnaxfi 



^GUE,M' ^GUE,M^ ^GUE,M ) • (5.25) 



Theorem 5.4 Let ■C^jy^ &e the maximal eigenvalue of an element of the 

^GUE,M 



M 



Equivalently, 
Equivalently, 



max 
GUE,M 



M 



in . 



in . 



M 



in L^. 
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Proof. Note that when = ■ ■ ■ = = l/M, vC(^';;;;;^j^), given by ([23D is 
the empty set when si < 0. Hence ^q'^^^m nonnegative (this is actually 
clear from the traceless requirement). By Theorem 13.11 H]^ and ^q'^^^m 
equal in distribution, and so it suffices to prove that, as M — > cxd, 

^ \SGUE,M) 



2. (5.26) 



Next, by Proposition 12.61 ^ {Cgu'eki) ~ ^ {^gu%,m\ Moreover, taking ex- 
pectations on both sides of fl5.12p gives: 

E {^guem) < E max^^Afc,fc^ + E max^XM-fc+i^ + E max^XM-fc 



It is well known (see [23]) that 



while, by Lemma [5.31 



E ( max Akk] < V2 In M, 

k=l,...,M 



limsup E ( max 1 = 



leading to 



Now, ^q'^em is nonnegative and by Theorem 15.21 ^jj^emI^TM — > 2, almost 



hmsupEl^^l <2. 



surely. Thus, by Fatou's Lemma, 



>E hminf =2, 



and so, limM^oo E (^^gue!m/ "/M^ = 2. Using once more the fact that ^que m 



is nonnegative, we conclude that limM^ooE ^gue^m/^ ^ ~ ^ 
the weak law of large number, limM^oo E ^qIj%; mI — 2 



= 0, and by 
0. □ 
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Remark 5.5 A small and elementary tightening of the arguments of David- 
son and Szarek (see also W^) will also provide an alternative proof of 
Theorem\5.4' 



Proof of Corollary 12.71 By Proposition \2.2\ 



M 



a= max e-Vp^J2vpi^^,i- 



max 

Since max ^* is the maximal eigenvalue of an element of the dk x dk 
GUE, with probability one or in the mean, limd^-*oo max C,^/ \fdk = 2. 
Moreover, Y2ff^i ■^/piX.i^i is a centered Gaussian random variable with variance 
Var ^X]f=i VPi'^i,iJ = 'YaLiPi = 1- Hence, with probability one or in the 
mean, lim^^^oo a/p^ YaU \/Pi^i,i/ Vdk = 0. □ 
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